Overview
Brought to you by YData
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 39737 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 4.9 MiB |
| Average record size in memory | 128.0 B |
Variable types
| Text | 3 |
|---|---|
| Numeric | 9 |
| Categorical | 2 |
| DateTime | 1 |
latitude is highly overall correlated with neighborhood_group | High correlation |
longitude is highly overall correlated with neighborhood_group | High correlation |
neighborhood_group is highly overall correlated with latitude and 1 other fields | High correlation |
number_of_reviews is highly overall correlated with reviews_per_month | High correlation |
reviews_per_month is highly overall correlated with number_of_reviews | High correlation |
Reproduction
| Analysis started | 2025-01-30 13:50:32.238024 |
|---|---|
| Analysis finished | 2025-01-30 13:50:56.754838 |
| Duration | 24.52 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
name
Text
| Distinct | 39017 |
|---|---|
| Distinct (%) | 98.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 620.9 KiB |
Length
| Max length | 179 |
|---|---|
| Median length | 70 |
| Mean length | 36.427461 |
| Min length | 1 |
Unique
| Unique | 38545 ? |
|---|---|
| Unique (%) | 97.0% |
Sample
| 1st row | Skylit Midtown Castle |
|---|---|
| 2nd row | THE VILLAGE OF HARLEM....NEW YORK ! |
| 3rd row | Cozy Entire Floor of Brownstone |
| 4th row | Entire Apt: Spacious Studio/Loft by central park |
| 5th row | Large Cozy 1 BR Apartment In Midtown East |
| Value | Count | Frequency (%) |
| in | 14237 | 6.0% |
| room | 8893 | 3.7% |
| private | 6344 | 2.7% |
| bedroom | 6282 | 2.6% |
| 6068 | 2.5% | |
| apartment | 5658 | 2.4% |
| cozy | 4486 | 1.9% |
| apt | 3718 | 1.6% |
| brooklyn | 3547 | 1.5% |
| the | 3266 | 1.4% |
| Other values (10349) | 176699 |
Most occurring characters
| Value | Count | Frequency (%) |
| 200767 | 13.9% | |
| e | 100522 | 6.9% |
| o | 100458 | 6.9% |
| t | 85893 | 5.9% |
| a | 85139 | 5.9% |
| r | 79916 | 5.5% |
| i | 77788 | 5.4% |
| n | 76899 | 5.3% |
| l | 41994 | 2.9% |
| m | 40929 | 2.8% |
| Other values (725) | 557213 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1447518 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 200767 | 13.9% | |
| e | 100522 | 6.9% |
| o | 100458 | 6.9% |
| t | 85893 | 5.9% |
| a | 85139 | 5.9% |
| r | 79916 | 5.5% |
| i | 77788 | 5.4% |
| n | 76899 | 5.3% |
| l | 41994 | 2.9% |
| m | 40929 | 2.8% |
| Other values (725) | 557213 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1447518 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 200767 | 13.9% | |
| e | 100522 | 6.9% |
| o | 100458 | 6.9% |
| t | 85893 | 5.9% |
| a | 85139 | 5.9% |
| r | 79916 | 5.5% |
| i | 77788 | 5.4% |
| n | 76899 | 5.3% |
| l | 41994 | 2.9% |
| m | 40929 | 2.8% |
| Other values (725) | 557213 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1447518 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 200767 | 13.9% | |
| e | 100522 | 6.9% |
| o | 100458 | 6.9% |
| t | 85893 | 5.9% |
| a | 85139 | 5.9% |
| r | 79916 | 5.5% |
| i | 77788 | 5.4% |
| n | 76899 | 5.3% |
| l | 41994 | 2.9% |
| m | 40929 | 2.8% |
| Other values (725) | 557213 |
host_id
Real number (ℝ)
| Distinct | 32367 |
|---|---|
| Distinct (%) | 81.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66218061 |
| Minimum | 2571 |
|---|---|
| Maximum | 2.7432131 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 2571 |
|---|---|
| 5-th percentile | 809681.2 |
| Q1 | 7824750 |
| median | 30736639 |
| Q3 | 1.0361186 × 108 |
| 95-th percentile | 2.396412 × 108 |
| Maximum | 2.7432131 × 108 |
| Range | 2.7431874 × 108 |
| Interquartile range (IQR) | 95787113 |
Descriptive statistics
| Standard deviation | 77502126 |
|---|---|
| Coefficient of variation (CV) | 1.1704077 |
| Kurtosis | 0.30851014 |
| Mean | 66218061 |
| Median Absolute Deviation (MAD) | 27174331 |
| Skewness | 1.2494698 |
| Sum | 2.6313071 × 1012 |
| Variance | 6.0065795 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 219517861 | 144 | 0.4% |
| 190921808 | 38 | 0.1% |
| 119669058 | 34 | 0.1% |
| 213781715 | 31 | 0.1% |
| 224414117 | 29 | 0.1% |
| 417504 | 23 | 0.1% |
| 252604696 | 20 | 0.1% |
| 134184451 | 18 | < 0.1% |
| 201015598 | 17 | < 0.1% |
| 159091490 | 17 | < 0.1% |
| Other values (32357) | 39366 |
| Value | Count | Frequency (%) |
| 2571 | 1 | < 0.1% |
| 2787 | 5 | |
| 2845 | 2 | < 0.1% |
| 2881 | 2 | < 0.1% |
| 3151 | 1 | < 0.1% |
| 3211 | 1 | < 0.1% |
| 3415 | 1 | < 0.1% |
| 3563 | 1 | < 0.1% |
| 3647 | 2 | < 0.1% |
| 3867 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 274321313 | 1 | |
| 274311461 | 1 | |
| 274307600 | 1 | |
| 274298453 | 1 | |
| 274273284 | 1 | |
| 274225617 | 1 | |
| 274195458 | 1 | |
| 274188386 | 1 | |
| 274103383 | 1 | |
| 274040642 | 1 |
host_name
Text
| Distinct | 10347 |
|---|---|
| Distinct (%) | 26.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 620.9 KiB |
Length
| Max length | 35 |
|---|---|
| Median length | 31 |
| Mean length | 6.0734832 |
| Min length | 1 |
Unique
| Unique | 6299 ? |
|---|---|
| Unique (%) | 15.9% |
Sample
| 1st row | Jennifer |
|---|---|
| 2nd row | Elisabeth |
| 3rd row | LisaRoxanne |
| 4th row | Laura |
| 5th row | Chris |
| Value | Count | Frequency (%) |
| 877 | 2.0% | |
| and | 510 | 1.2% |
| michael | 365 | 0.8% |
| david | 359 | 0.8% |
| john | 268 | 0.6% |
| alex | 260 | 0.6% |
| sarah | 220 | 0.5% |
| maria | 209 | 0.5% |
| daniel | 202 | 0.5% |
| jessica | 182 | 0.4% |
| Other values (9354) | 40569 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 31078 | 12.9% |
| e | 23296 | 9.7% |
| i | 20143 | 8.3% |
| n | 19494 | 8.1% |
| r | 14173 | 5.9% |
| l | 12539 | 5.2% |
| o | 10024 | 4.2% |
| t | 7653 | 3.2% |
| s | 7608 | 3.2% |
| h | 7577 | 3.1% |
| Other values (176) | 87757 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 241342 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 31078 | 12.9% |
| e | 23296 | 9.7% |
| i | 20143 | 8.3% |
| n | 19494 | 8.1% |
| r | 14173 | 5.9% |
| l | 12539 | 5.2% |
| o | 10024 | 4.2% |
| t | 7653 | 3.2% |
| s | 7608 | 3.2% |
| h | 7577 | 3.1% |
| Other values (176) | 87757 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 241342 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 31078 | 12.9% |
| e | 23296 | 9.7% |
| i | 20143 | 8.3% |
| n | 19494 | 8.1% |
| r | 14173 | 5.9% |
| l | 12539 | 5.2% |
| o | 10024 | 4.2% |
| t | 7653 | 3.2% |
| s | 7608 | 3.2% |
| h | 7577 | 3.1% |
| Other values (176) | 87757 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 241342 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 31078 | 12.9% |
| e | 23296 | 9.7% |
| i | 20143 | 8.3% |
| n | 19494 | 8.1% |
| r | 14173 | 5.9% |
| l | 12539 | 5.2% |
| o | 10024 | 4.2% |
| t | 7653 | 3.2% |
| s | 7608 | 3.2% |
| h | 7577 | 3.1% |
| Other values (176) | 87757 |
neighborhood_group
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 620.9 KiB |
| Brooklyn | |
|---|---|
| Manhattan | |
| Queens | |
| Bronx | 1010 |
| Staten Island | 344 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 8.1168936 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Manhattan |
|---|---|
| 2nd row | Manhattan |
| 3rd row | Brooklyn |
| 4th row | Manhattan |
| 5th row | Manhattan |
Common Values
| Value | Count | Frequency (%) |
| Brooklyn | 17347 | |
| Manhattan | 16009 | |
| Queens | 5027 | 12.7% |
| Bronx | 1010 | 2.5% |
| Staten Island | 344 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 17347 | |
| manhattan | 16009 | |
| queens | 5027 | 12.5% |
| bronx | 1010 | 2.5% |
| staten | 344 | 0.9% |
| island | 344 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 56090 | |
| a | 48715 | |
| o | 35704 | |
| t | 32706 | |
| r | 18357 | 5.7% |
| B | 18357 | 5.7% |
| l | 17691 | 5.5% |
| y | 17347 | 5.4% |
| k | 17347 | 5.4% |
| M | 16009 | 5.0% |
| Other values (10) | 44218 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 322541 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 56090 | |
| a | 48715 | |
| o | 35704 | |
| t | 32706 | |
| r | 18357 | 5.7% |
| B | 18357 | 5.7% |
| l | 17691 | 5.5% |
| y | 17347 | 5.4% |
| k | 17347 | 5.4% |
| M | 16009 | 5.0% |
| Other values (10) | 44218 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 322541 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 56090 | |
| a | 48715 | |
| o | 35704 | |
| t | 32706 | |
| r | 18357 | 5.7% |
| B | 18357 | 5.7% |
| l | 17691 | 5.5% |
| y | 17347 | 5.4% |
| k | 17347 | 5.4% |
| M | 16009 | 5.0% |
| Other values (10) | 44218 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 322541 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 56090 | |
| a | 48715 | |
| o | 35704 | |
| t | 32706 | |
| r | 18357 | 5.7% |
| B | 18357 | 5.7% |
| l | 17691 | 5.5% |
| y | 17347 | 5.4% |
| k | 17347 | 5.4% |
| M | 16009 | 5.0% |
| Other values (10) | 44218 |
neighborhood
Text
| Distinct | 219 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 620.9 KiB |
Length
| Max length | 26 |
|---|---|
| Median length | 17 |
| Mean length | 11.921635 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Midtown |
|---|---|
| 2nd row | Harlem |
| 3rd row | Clinton Hill |
| 4th row | East Harlem |
| 5th row | Murray Hill |
| Value | Count | Frequency (%) |
| east | 5353 | 8.4% |
| side | 3410 | 5.4% |
| williamsburg | 3363 | 5.3% |
| harlem | 3277 | 5.2% |
| bedford-stuyvesant | 3242 | 5.1% |
| heights | 3155 | 5.0% |
| upper | 2664 | 4.2% |
| village | 2509 | 3.9% |
| bushwick | 2153 | 3.4% |
| west | 1957 | 3.1% |
| Other values (231) | 32443 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 43042 | 9.1% |
| i | 33346 | 7.0% |
| s | 32865 | 6.9% |
| a | 31054 | 6.6% |
| t | 31027 | 6.5% |
| l | 27898 | 5.9% |
| r | 27565 | 5.8% |
| 23789 | 5.0% | |
| n | 21420 | 4.5% |
| o | 20070 | 4.2% |
| Other values (44) | 181654 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 473730 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 43042 | 9.1% |
| i | 33346 | 7.0% |
| s | 32865 | 6.9% |
| a | 31054 | 6.6% |
| t | 31027 | 6.5% |
| l | 27898 | 5.9% |
| r | 27565 | 5.8% |
| 23789 | 5.0% | |
| n | 21420 | 4.5% |
| o | 20070 | 4.2% |
| Other values (44) | 181654 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 473730 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 43042 | 9.1% |
| i | 33346 | 7.0% |
| s | 32865 | 6.9% |
| a | 31054 | 6.6% |
| t | 31027 | 6.5% |
| l | 27898 | 5.9% |
| r | 27565 | 5.8% |
| 23789 | 5.0% | |
| n | 21420 | 4.5% |
| o | 20070 | 4.2% |
| Other values (44) | 181654 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 473730 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 43042 | 9.1% |
| i | 33346 | 7.0% |
| s | 32865 | 6.9% |
| a | 31054 | 6.6% |
| t | 31027 | 6.5% |
| l | 27898 | 5.9% |
| r | 27565 | 5.8% |
| 23789 | 5.0% | |
| n | 21420 | 4.5% |
| o | 20070 | 4.2% |
| Other values (44) | 181654 |
latitude
Real number (ℝ)
High correlation 
| Distinct | 17805 |
|---|---|
| Distinct (%) | 44.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.727575 |
| Minimum | 40.49979 |
|---|---|
| Maximum | 40.91306 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 40.49979 |
|---|---|
| 5-th percentile | 40.643716 |
| Q1 | 40.68808 |
| median | 40.72008 |
| Q3 | 40.76326 |
| 95-th percentile | 40.828082 |
| Maximum | 40.91306 |
| Range | 0.41327 |
| Interquartile range (IQR) | 0.07518 |
Descriptive statistics
| Standard deviation | 0.056292643 |
|---|---|
| Coefficient of variation (CV) | 0.0013821752 |
| Kurtosis | 0.060462811 |
| Mean | 40.727575 |
| Median Absolute Deviation (MAD) | 0.03622 |
| Skewness | 0.29695214 |
| Sum | 1618391.7 |
| Variance | 0.0031688616 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.71813 | 17 | < 0.1% |
| 40.70766 | 11 | < 0.1% |
| 40.68444 | 11 | < 0.1% |
| 40.71353 | 11 | < 0.1% |
| 40.68634 | 11 | < 0.1% |
| 40.69414 | 11 | < 0.1% |
| 40.68374 | 10 | < 0.1% |
| 40.69454 | 10 | < 0.1% |
| 40.68683 | 10 | < 0.1% |
| 40.72085 | 10 | < 0.1% |
| Other values (17795) | 39625 |
| Value | Count | Frequency (%) |
| 40.49979 | 1 | |
| 40.50641 | 1 | |
| 40.50708 | 1 | |
| 40.50868 | 1 | |
| 40.50873 | 1 | |
| 40.50943 | 1 | |
| 40.51133 | 1 | |
| 40.52211 | 1 | |
| 40.52293 | 1 | |
| 40.527 | 1 |
| Value | Count | Frequency (%) |
| 40.91306 | 1 | |
| 40.91234 | 1 | |
| 40.91167 | 1 | |
| 40.90804 | 1 | |
| 40.90734 | 1 | |
| 40.90484 | 1 | |
| 40.90406 | 1 | |
| 40.90391 | 1 | |
| 40.90356 | 1 | |
| 40.90329 | 1 |
longitude
Real number (ℝ)
High correlation 
| Distinct | 13979 |
|---|---|
| Distinct (%) | 35.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.949145 |
| Minimum | -74.24442 |
|---|---|
| Maximum | -73.71299 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 39737 |
| Negative (%) | 100.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | -74.24442 |
|---|---|
| 5-th percentile | -74.00262 |
| Q1 | -73.98104 |
| median | -73.95332 |
| Q3 | -73.93217 |
| 95-th percentile | -73.85607 |
| Maximum | -73.71299 |
| Range | 0.53143 |
| Interquartile range (IQR) | 0.04887 |
Descriptive statistics
| Standard deviation | 0.047708551 |
|---|---|
| Coefficient of variation (CV) | -0.00064515352 |
| Kurtosis | 4.6640199 |
| Mean | -73.949145 |
| Median Absolute Deviation (MAD) | 0.02496 |
| Skewness | 1.2118019 |
| Sum | -2938517.2 |
| Variance | 0.0022761059 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.95332 | 16 | < 0.1% |
| -73.95677 | 16 | < 0.1% |
| -73.94791 | 16 | < 0.1% |
| -73.9506 | 15 | < 0.1% |
| -73.9435 | 14 | < 0.1% |
| -73.94537 | 14 | < 0.1% |
| -73.95551 | 14 | < 0.1% |
| -73.98439 | 14 | < 0.1% |
| -73.95427 | 14 | < 0.1% |
| -73.95136 | 14 | < 0.1% |
| Other values (13969) | 39590 |
| Value | Count | Frequency (%) |
| -74.24442 | 1 | |
| -74.24285 | 1 | |
| -74.24084 | 1 | |
| -74.23986 | 1 | |
| -74.23914 | 1 | |
| -74.23803 | 1 | |
| -74.23059 | 1 | |
| -74.21238 | 1 | |
| -74.21017 | 1 | |
| -74.20941 | 1 |
| Value | Count | Frequency (%) |
| -73.71299 | 1 | |
| -73.7169 | 1 | |
| -73.71795 | 1 | |
| -73.71829 | 1 | |
| -73.71928 | 1 | |
| -73.72173 | 1 | |
| -73.72179 | 1 | |
| -73.72247 | 1 | |
| -73.72435 | 1 | |
| -73.72581 | 1 |
room_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 620.9 KiB |
| Private room | |
|---|---|
| Entire home/apt | |
| Shared room | 990 |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 13.400609 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Entire home/apt |
|---|---|
| 2nd row | Private room |
| 3rd row | Entire home/apt |
| 4th row | Entire home/apt |
| 5th row | Entire home/apt |
Common Values
| Value | Count | Frequency (%) |
| Private room | 19865 | |
| Entire home/apt | 18882 | |
| Shared room | 990 | 2.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| room | 20855 | |
| private | 19865 | |
| entire | 18882 | |
| home/apt | 18882 | |
| shared | 990 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 60592 | |
| r | 60592 | |
| e | 58619 | |
| t | 57629 | |
| m | 39737 | |
| a | 39737 | |
| 39737 | ||
| i | 38747 | 7.3% |
| h | 19872 | 3.7% |
| P | 19865 | 3.7% |
| Other values (7) | 97373 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 532500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 60592 | |
| r | 60592 | |
| e | 58619 | |
| t | 57629 | |
| m | 39737 | |
| a | 39737 | |
| 39737 | ||
| i | 38747 | 7.3% |
| h | 19872 | 3.7% |
| P | 19865 | 3.7% |
| Other values (7) | 97373 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 532500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 60592 | |
| r | 60592 | |
| e | 58619 | |
| t | 57629 | |
| m | 39737 | |
| a | 39737 | |
| 39737 | ||
| i | 38747 | 7.3% |
| h | 19872 | 3.7% |
| P | 19865 | 3.7% |
| Other values (7) | 97373 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 532500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 60592 | |
| r | 60592 | |
| e | 58619 | |
| t | 57629 | |
| m | 39737 | |
| a | 39737 | |
| 39737 | ||
| i | 38747 | 7.3% |
| h | 19872 | 3.7% |
| P | 19865 | 3.7% |
| Other values (7) | 97373 |
price
Real number (ℝ)
| Distinct | 318 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 119.02768 |
| Minimum | 10 |
|---|---|
| Maximum | 334 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 65 |
| median | 100 |
| Q3 | 155 |
| 95-th percentile | 250 |
| Maximum | 334 |
| Range | 324 |
| Interquartile range (IQR) | 90 |
Descriptive statistics
| Standard deviation | 67.161063 |
|---|---|
| Coefficient of variation (CV) | 0.56424743 |
| Kurtosis | 0.26883518 |
| Mean | 119.02768 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 0.9609735 |
| Sum | 4729803 |
| Variance | 4510.6084 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 1855 | 4.7% |
| 150 | 1782 | 4.5% |
| 50 | 1353 | 3.4% |
| 60 | 1310 | 3.3% |
| 75 | 1276 | 3.2% |
| 200 | 1239 | 3.1% |
| 80 | 1164 | 2.9% |
| 65 | 1084 | 2.7% |
| 70 | 1064 | 2.7% |
| 120 | 999 | 2.5% |
| Other values (308) | 26611 |
| Value | Count | Frequency (%) |
| 10 | 16 | |
| 11 | 3 | < 0.1% |
| 12 | 3 | < 0.1% |
| 13 | 1 | < 0.1% |
| 15 | 5 | < 0.1% |
| 16 | 5 | < 0.1% |
| 18 | 2 | < 0.1% |
| 19 | 3 | < 0.1% |
| 20 | 28 | |
| 21 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 334 | 1 | < 0.1% |
| 333 | 6 | < 0.1% |
| 332 | 1 | < 0.1% |
| 331 | 1 | < 0.1% |
| 330 | 24 | 0.1% |
| 329 | 10 | < 0.1% |
| 328 | 3 | < 0.1% |
| 327 | 1 | < 0.1% |
| 325 | 111 | |
| 324 | 2 | < 0.1% |
minimum_nights
Real number (ℝ)
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6962025 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8635408 |
|---|---|
| Coefficient of variation (CV) | 0.69117241 |
| Kurtosis | 2.4854943 |
| Mean | 2.6962025 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.5375781 |
| Sum | 107139 |
| Variance | 3.4727843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 12066 | |
| 2 | 11080 | |
| 3 | 7375 | |
| 4 | 3066 | 7.7% |
| 5 | 2821 | 7.1% |
| 7 | 1951 | 4.9% |
| 6 | 679 | 1.7% |
| 10 | 462 | 1.2% |
| 8 | 127 | 0.3% |
| 9 | 79 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 12066 | |
| 2 | 11080 | |
| 3 | 7375 | |
| 4 | 3066 | 7.7% |
| 5 | 2821 | 7.1% |
| 6 | 679 | 1.7% |
| 7 | 1951 | 4.9% |
| 8 | 127 | 0.3% |
| 9 | 79 | 0.2% |
| 10 | 462 | 1.2% |
| Value | Count | Frequency (%) |
| 11 | 31 | 0.1% |
| 10 | 462 | 1.2% |
| 9 | 79 | 0.2% |
| 8 | 127 | 0.3% |
| 7 | 1951 | 4.9% |
| 6 | 679 | 1.7% |
| 5 | 2821 | 7.1% |
| 4 | 3066 | 7.7% |
| 3 | 7375 | |
| 2 | 11080 |
number_of_reviews
Real number (ℝ)
High correlation 
| Distinct | 391 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.728478 |
| Minimum | 1 |
|---|---|
| Maximum | 629 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 17 |
| Q3 | 31.728478 |
| 95-th percentile | 124 |
| Maximum | 629 |
| Range | 628 |
| Interquartile range (IQR) | 27.728478 |
Descriptive statistics
| Standard deviation | 45.964693 |
|---|---|
| Coefficient of variation (CV) | 1.4486889 |
| Kurtosis | 17.831984 |
| Mean | 31.728478 |
| Median Absolute Deviation (MAD) | 14.728478 |
| Skewness | 3.4592964 |
| Sum | 1260794.5 |
| Variance | 2112.753 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31.72847802 | 6701 | 16.9% |
| 1 | 4047 | 10.2% |
| 2 | 2794 | 7.0% |
| 3 | 2033 | 5.1% |
| 4 | 1631 | 4.1% |
| 5 | 1288 | 3.2% |
| 6 | 1137 | 2.9% |
| 7 | 969 | 2.4% |
| 8 | 959 | 2.4% |
| 9 | 809 | 2.0% |
| Other values (381) | 17369 |
| Value | Count | Frequency (%) |
| 1 | 4047 | |
| 2 | 2794 | |
| 3 | 2033 | |
| 4 | 1631 | |
| 5 | 1288 | 3.2% |
| 6 | 1137 | 2.9% |
| 7 | 969 | 2.4% |
| 8 | 959 | 2.4% |
| 9 | 809 | 2.0% |
| 10 | 666 | 1.7% |
| Value | Count | Frequency (%) |
| 629 | 1 | |
| 607 | 1 | |
| 597 | 1 | |
| 594 | 1 | |
| 576 | 1 | |
| 543 | 1 | |
| 540 | 1 | |
| 510 | 1 | |
| 488 | 1 | |
| 480 | 1 |
last_review
Date
| Distinct | 1696 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 620.9 KiB |
| Minimum | 2011-03-28 00:00:00 |
|---|---|
| Maximum | 2019-07-08 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
reviews_per_month
Real number (ℝ)
High correlation 
| Distinct | 937 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4637252 |
| Minimum | 0.01 |
|---|---|
| Maximum | 58.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.04 |
| Q1 | 0.29 |
| median | 1.25 |
| Q3 | 1.89 |
| 95-th percentile | 4.562 |
| Maximum | 58.5 |
| Range | 58.49 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.5899264 |
|---|---|
| Coefficient of variation (CV) | 1.0862192 |
| Kurtosis | 51.348297 |
| Mean | 1.4637252 |
| Median Absolute Deviation (MAD) | 0.88 |
| Skewness | 3.3957793 |
| Sum | 58164.047 |
| Variance | 2.5278658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.373251377 | 6701 | 16.9% |
| 0.02 | 811 | 2.0% |
| 0.05 | 737 | 1.9% |
| 1 | 721 | 1.8% |
| 0.03 | 666 | 1.7% |
| 0.16 | 537 | 1.4% |
| 0.04 | 518 | 1.3% |
| 0.08 | 486 | 1.2% |
| 0.09 | 452 | 1.1% |
| 0.06 | 450 | 1.1% |
| Other values (927) | 27658 |
| Value | Count | Frequency (%) |
| 0.01 | 30 | 0.1% |
| 0.02 | 811 | |
| 0.03 | 666 | |
| 0.04 | 518 | |
| 0.05 | 737 | |
| 0.06 | 450 | |
| 0.07 | 359 | |
| 0.08 | 486 | |
| 0.09 | 452 | |
| 0.1 | 357 |
| Value | Count | Frequency (%) |
| 58.5 | 1 | |
| 27.95 | 1 | |
| 20.94 | 1 | |
| 19.75 | 1 | |
| 17.82 | 1 | |
| 16.81 | 1 | |
| 16.22 | 1 | |
| 16.03 | 1 | |
| 15.78 | 1 | |
| 15.32 | 1 |
calculated_host_listings_count
Real number (ℝ)
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0729798 |
| Minimum | 1 |
|---|---|
| Maximum | 327 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 327 |
| Range | 326 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 19.744108 |
|---|---|
| Coefficient of variation (CV) | 6.425069 |
| Kurtosis | 259.63234 |
| Mean | 3.0729798 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.026103 |
| Sum | 122111 |
| Variance | 389.82979 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 27692 | |
| 2 | 5834 | 14.7% |
| 3 | 2454 | 6.2% |
| 4 | 1142 | 2.9% |
| 5 | 677 | 1.7% |
| 6 | 403 | 1.0% |
| 7 | 310 | 0.8% |
| 8 | 263 | 0.7% |
| 9 | 151 | 0.4% |
| 327 | 144 | 0.4% |
| Other values (16) | 667 | 1.7% |
| Value | Count | Frequency (%) |
| 1 | 27692 | |
| 2 | 5834 | 14.7% |
| 3 | 2454 | 6.2% |
| 4 | 1142 | 2.9% |
| 5 | 677 | 1.7% |
| 6 | 403 | 1.0% |
| 7 | 310 | 0.8% |
| 8 | 263 | 0.7% |
| 9 | 151 | 0.4% |
| 10 | 139 | 0.3% |
| Value | Count | Frequency (%) |
| 327 | 144 | |
| 47 | 38 | 0.1% |
| 43 | 3 | < 0.1% |
| 34 | 34 | 0.1% |
| 33 | 31 | 0.1% |
| 30 | 29 | 0.1% |
| 28 | 23 | 0.1% |
| 26 | 13 | < 0.1% |
| 20 | 20 | 0.1% |
| 18 | 18 | < 0.1% |
availability_365
Real number (ℝ)
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 160.52594 |
| Minimum | 1 |
|---|---|
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 620.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 90 |
| median | 160.52594 |
| Q3 | 179 |
| 95-th percentile | 354 |
| Maximum | 365 |
| Range | 364 |
| Interquartile range (IQR) | 89 |
Descriptive statistics
| Standard deviation | 96.482529 |
|---|---|
| Coefficient of variation (CV) | 0.6010401 |
| Kurtosis | -0.29181954 |
| Mean | 160.52594 |
| Median Absolute Deviation (MAD) | 44.525944 |
| Skewness | 0.3979936 |
| Sum | 6378819.4 |
| Variance | 9308.8784 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 160.5259438 | 15685 | |
| 365 | 755 | 1.9% |
| 1 | 353 | 0.9% |
| 5 | 307 | 0.8% |
| 89 | 304 | 0.8% |
| 3 | 275 | 0.7% |
| 364 | 273 | 0.7% |
| 179 | 239 | 0.6% |
| 2 | 234 | 0.6% |
| 90 | 231 | 0.6% |
| Other values (356) | 21081 |
| Value | Count | Frequency (%) |
| 1 | 353 | |
| 2 | 234 | |
| 3 | 275 | |
| 4 | 210 | |
| 5 | 307 | |
| 6 | 227 | |
| 7 | 192 | |
| 8 | 215 | |
| 9 | 177 | |
| 10 | 151 |
| Value | Count | Frequency (%) |
| 365 | 755 | |
| 364 | 273 | 0.7% |
| 363 | 180 | 0.5% |
| 362 | 123 | 0.3% |
| 361 | 82 | 0.2% |
| 360 | 84 | 0.2% |
| 359 | 119 | 0.3% |
| 358 | 120 | 0.3% |
| 357 | 66 | 0.2% |
| 356 | 63 | 0.2% |
Interactions
Correlations
| availability_365 | calculated_host_listings_count | host_id | latitude | longitude | minimum_nights | neighborhood_group | number_of_reviews | price | reviews_per_month | room_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| availability_365 | 1.000 | 0.163 | 0.031 | -0.029 | 0.054 | -0.081 | 0.095 | 0.038 | -0.007 | -0.035 | 0.122 |
| calculated_host_listings_count | 0.163 | 1.000 | 0.135 | -0.049 | 0.134 | -0.150 | 0.052 | 0.120 | -0.192 | 0.198 | 0.063 |
| host_id | 0.031 | 0.135 | 1.000 | 0.040 | 0.146 | -0.184 | 0.109 | -0.072 | -0.110 | 0.245 | 0.098 |
| latitude | -0.029 | -0.049 | 0.040 | 1.000 | 0.056 | -0.023 | 0.542 | -0.004 | 0.114 | -0.007 | 0.097 |
| longitude | 0.054 | 0.134 | 0.146 | 0.056 | 1.000 | -0.085 | 0.654 | 0.050 | -0.401 | 0.109 | 0.125 |
| minimum_nights | -0.081 | -0.150 | -0.184 | -0.023 | -0.085 | 1.000 | 0.053 | -0.099 | 0.118 | -0.211 | 0.122 |
| neighborhood_group | 0.095 | 0.052 | 0.109 | 0.542 | 0.654 | 0.053 | 1.000 | 0.022 | 0.172 | 0.045 | 0.089 |
| number_of_reviews | 0.038 | 0.120 | -0.072 | -0.004 | 0.050 | -0.099 | 0.022 | 1.000 | 0.008 | 0.706 | 0.027 |
| price | -0.007 | -0.192 | -0.110 | 0.114 | -0.401 | 0.118 | 0.172 | 0.008 | 1.000 | -0.016 | 0.491 |
| reviews_per_month | -0.035 | 0.198 | 0.245 | -0.007 | 0.109 | -0.211 | 0.045 | 0.706 | -0.016 | 1.000 | 0.021 |
| room_type | 0.122 | 0.063 | 0.098 | 0.097 | 0.125 | 0.122 | 0.089 | 0.027 | 0.491 | 0.021 | 1.000 |
Missing values
Sample
| name | host_id | host_name | neighborhood_group | neighborhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | |||||||||||||||
| 2595 | Skylit Midtown Castle | 2845 | Jennifer | Manhattan | Midtown | 40.75362 | -73.98377 | Entire home/apt | 225.0 | 1 | 45.000000 | 2019-05-21 | 0.380000 | 2 | 355.000000 |
| 3647 | THE VILLAGE OF HARLEM....NEW YORK ! | 4632 | Elisabeth | Manhattan | Harlem | 40.80902 | -73.94190 | Private room | 150.0 | 3 | 31.728478 | 2019-07-08 | 1.373251 | 1 | 365.000000 |
| 3831 | Cozy Entire Floor of Brownstone | 4869 | LisaRoxanne | Brooklyn | Clinton Hill | 40.68514 | -73.95976 | Entire home/apt | 89.0 | 1 | 270.000000 | 2019-07-05 | 4.640000 | 1 | 194.000000 |
| 5022 | Entire Apt: Spacious Studio/Loft by central park | 7192 | Laura | Manhattan | East Harlem | 40.79851 | -73.94399 | Entire home/apt | 80.0 | 10 | 9.000000 | 2018-11-19 | 0.100000 | 1 | 160.525944 |
| 5099 | Large Cozy 1 BR Apartment In Midtown East | 7322 | Chris | Manhattan | Murray Hill | 40.74767 | -73.97500 | Entire home/apt | 200.0 | 3 | 74.000000 | 2019-06-22 | 0.590000 | 1 | 129.000000 |
| 5178 | Large Furnished Room Near B'way | 8967 | Shunichi | Manhattan | Hell's Kitchen | 40.76489 | -73.98493 | Private room | 79.0 | 2 | 430.000000 | 2019-06-24 | 3.470000 | 1 | 220.000000 |
| 5203 | Cozy Clean Guest Room - Family Apt | 7490 | MaryEllen | Manhattan | Upper West Side | 40.80178 | -73.96723 | Private room | 79.0 | 2 | 118.000000 | 2017-07-21 | 0.990000 | 1 | 160.525944 |
| 5238 | Cute & Cozy Lower East Side 1 bdrm | 7549 | Ben | Manhattan | Chinatown | 40.71344 | -73.99037 | Entire home/apt | 150.0 | 1 | 160.000000 | 2019-06-09 | 1.330000 | 4 | 188.000000 |
| 5295 | Beautiful 1br on Upper West Side | 7702 | Lena | Manhattan | Upper West Side | 40.80316 | -73.96545 | Entire home/apt | 135.0 | 5 | 53.000000 | 2019-06-22 | 0.430000 | 1 | 6.000000 |
| 5441 | Central Manhattan/near Broadway | 7989 | Kate | Manhattan | Hell's Kitchen | 40.76076 | -73.98867 | Private room | 85.0 | 2 | 188.000000 | 2019-06-23 | 1.500000 | 1 | 39.000000 |
| name | host_id | host_name | neighborhood_group | neighborhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | |||||||||||||||
| 36482809 | Stunning Bedroom NYC! Walking to Central Park!! | 131529729 | Kendall | Manhattan | East Harlem | 40.79633 | -73.93605 | Private room | 75.0 | 2 | 31.728478 | 2019-07-08 | 1.373251 | 2 | 353.0 |
| 36483010 | Comfy 1 Bedroom in Midtown East | 274311461 | Scott | Manhattan | Midtown | 40.75561 | -73.96723 | Entire home/apt | 200.0 | 6 | 31.728478 | 2019-07-08 | 1.373251 | 1 | 176.0 |
| 36483152 | Garden Jewel Apartment in Williamsburg New York | 208514239 | Melki | Brooklyn | Williamsburg | 40.71232 | -73.94220 | Entire home/apt | 170.0 | 1 | 31.728478 | 2019-07-08 | 1.373251 | 3 | 365.0 |
| 36484087 | Spacious Room w/ Private Rooftop, Central location | 274321313 | Kat | Manhattan | Hell's Kitchen | 40.76392 | -73.99183 | Private room | 125.0 | 4 | 31.728478 | 2019-07-08 | 1.373251 | 1 | 31.0 |
| 36484363 | QUIT PRIVATE HOUSE | 107716952 | Michael | Queens | Jamaica | 40.69137 | -73.80844 | Private room | 65.0 | 1 | 31.728478 | 2019-07-08 | 1.373251 | 2 | 163.0 |
| 36484665 | Charming one bedroom - newly renovated rowhouse | 8232441 | Sabrina | Brooklyn | Bedford-Stuyvesant | 40.67853 | -73.94995 | Private room | 70.0 | 2 | 31.728478 | 2019-07-08 | 1.373251 | 2 | 9.0 |
| 36485057 | Affordable room in Bushwick/East Williamsburg | 6570630 | Marisol | Brooklyn | Bushwick | 40.70184 | -73.93317 | Private room | 40.0 | 4 | 31.728478 | 2019-07-08 | 1.373251 | 2 | 36.0 |
| 36485431 | Sunny Studio at Historical Neighborhood | 23492952 | Ilgar & Aysel | Manhattan | Harlem | 40.81475 | -73.94867 | Entire home/apt | 115.0 | 10 | 31.728478 | 2019-07-08 | 1.373251 | 1 | 27.0 |
| 36485609 | 43rd St. Time Square-cozy single bed | 30985759 | Taz | Manhattan | Hell's Kitchen | 40.75751 | -73.99112 | Shared room | 55.0 | 1 | 31.728478 | 2019-07-08 | 1.373251 | 6 | 2.0 |
| 36487245 | Trendy duplex in the very heart of Hell's Kitchen | 68119814 | Christophe | Manhattan | Hell's Kitchen | 40.76404 | -73.98933 | Private room | 90.0 | 7 | 31.728478 | 2019-07-08 | 1.373251 | 1 | 23.0 |